Missing Click History in Sponsored Search: A Generative Modeling Solution

نویسندگان

  • Özgür Çetin
  • Kannan Achan
  • Erick Cantu-Paz
  • Rukmini Iyer
چکیده

A fundamental problem in sponsored search advertising is the estimation of probability of click for ads displayed in response to search queries. The historical click-through rate (CTR) is one of the most important predictors of the click, and extracted at multiple resolutions of the query-ad hierarchy. However, the new ads do not have any click history, and even the existing ads might miss history at some resolutions due to, for example, tail queries. In addition to a loss in accuracy, the missing features introduce significant complexity in designing conditional probability of click models such as the maximum-entropy model. In this paper, we develop a generative modeling solution to handle missing features in the maximum-entropy and other conditional models. In particular, a mixture of multivariate Gaussian distributions is used to learn a representation of the CTR features. This mixture model then provides information about the missing features to the maximum-entropy model in increasing degrees of sophistication, ranging from the pointwise estimates of the missing features to multi-way interaction terms and to novel posterior features. We show the utility of this approach for sponsored click prediction using the click-view data collected from Yahoo! search engine. We find that the generative modeling approach not only improves click prediction accuracy over a state-of-the-art system, but also results in a significantly less complex system.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Ensemble Click Model for Web Document Ranking

Annually, web search engine providers spend more and more money on documents ranking in search engines result pages (SERP). Click models provide advantageous information for ranking documents in SERPs through modeling interactions among users and search engines. Here, three modules are employed to create a hybrid click model; the first module is a PGM-based click model, the second module in a d...

متن کامل

Modeling Attractiveness and Multiple Clicks in Sponsored Search Results

Click models are an important tool for leveraging user feedback, and are used by commercial search engines for surfacing relevant search results. However, existing click models are lacking in two aspects. First, they do not share information across search results when computing attractiveness. Second, they assume that users interact with the search results sequentially. Based on our analysis of...

متن کامل

An Empirical Analysis of Search Engine Advertising: Sponsored Search in Electronic Markets

T phenomenon of sponsored search advertising—where advertisers pay a fee to Internet search engines to be displayed alongside organic (nonsponsored) Web search results—is gaining ground as the largest source of revenues for search engines. Using a unique six-month panel data set of several hundred keywords collected from a large nationwide retailer that advertises on Google, we empirically mode...

متن کامل

Click Fraud

T oday, Web search engines are the primary method for millions of users throughout the world to access information on a topic, navigate to Web sites, keep up with the news, and shop online. Most major search engines generate revenue via sponsored search, a process whereby content providers pay for traffic from specific links the search engine's display in response to user queries. Search engine...

متن کامل

The Business Next Door: Click-Through Rate Modeling for Local Search

Computational advertising has received a tremendous amount of attention from the business and academic community recently. Great advances have been made in modeling click-through rates in well studied settings, such as, sponsored search and context match. However, local search has received relatively little attention. Geographic nature of local search and associated local browsing leads to inte...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010